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Abstract 

We consider a stochastic A^-particle model for the spatially homoge- 
neous Boltzmann evolution and prove its convergence to the associated 
Boltzmann equation when N — > oo. For any time T > we bound the 
distance between the empirical measure of the particle system and the 
measure given by the Boltzmann evolution in some homogeneous nega- 
tive Sobolev space. The control we get is Gaussian, i.e. we prove that the 
distance is bigger than xN^^^'^ with a probability of type 0{e^^ ). The 
two main ingredients are first a control of fluctuations due to the discrete 
nature of collisions, secondly a Lipschitz continuity for the Boltzmann 
collision kernel. The latter condition, in our present setting, is only sat- 
isfied for Maxwellian models. Numerical computations tend to show that 
our results are useful in practice. 



Introduction 

The Boltzmann equation was written down by L. Boltzmann |f2l in 1872, five 
years after Maxwell's seminal paper [13], to describe the behaviour of a large 
number of gas molecules interacting by pairwise collisions. Proving rigor- 
ously the heuristic arguments of Boltzmann to get some convergence of the 
A^-particle model to the continuous Boltzmann equation when N — > oo is 
an extremely difficult challenge (it motivated Hilbert's 6th problem [11]) that 
mathematicians are still dealing with. 

Here we are only going to handle the spatially homogeneous Boltzmann 
equation (also called mean field Boltzmann equation), in which one forgets 
the positions of the gas particles to concentrate only on the collision phe- 
nomenon. Then proving the convergence of an A^-particle system to the con- 
tinuous equation is a typical mean field limit problem: a particle model is 
said to be mean field when each particle interacts with comparable strength 
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with all the other ones. Such a problem, which was first proposed by Kac lfT2ll . 
is far more tractable than the original one, and convergence results, mostly 
qualitative, have already been obtained for it (see § 16. 4D . Here however we are 
interested in a quantitative version of these results. 

The goal of this paper is not only to get some quantitative results of conver- 
gence for spatially homogeneous particle collision models, but also to get an 
iV~^/^ convergence speed, typical of the uniform central limit theory (see [8j| 
about that theory), and to prove some non-asymptotic bounds. Concerning 
"real" Boltzmann models, in the actual state of my work I am only able to use 
the results for Maxwellian systems, and moreover constants in convergence 
bounds deteriorate rapidly with time. However that does not seem to be a 
fundamental feature of my approach, and I am currently working on further 
improvements to overcome these issues. 

Here is some notation which will be used throughout this paper: 

• The space IR!^ is equipped with its Euclidean structure, whose norm is 
denoted by | • | . 

• f : E — > F being a measurable function and /i a measure on E, the 
image measure of by / on F will be denoted 

• 6x denotes a Dirac mass at x. 

• S(R'^) is the Schwartz space on R"*, i.e. the set of (complex-valued) C°° 
functions on K'^ which tend to at infinity faster than any |x| as well 
as all their derivatives. 

• The Fourier transform of a function / G iS(]R'^) is denoted by /, with the 
unitary convention f{^) = (27r)"'^/^ J^^ /(a;)e~*^'''ck. 

• The notation || ■ || will be used to denote Hilbert norms in functional 
spaces. If Q is a linear operator between two Hilbert spaces, its oper- 
ator norm sup||^.||^i \\Qx\\ will be denoted HQf. 

• x,y and z being three points of an affine Hilbert space with y,z ^ x, yxz 
denotes the angle between xy and xz, which is an element of [0, tt]. 

• The identity matrix of size d is denoted 1^. 



1 The model 

1.1 The microscopic model 

Let us describe the particle model for the spatially homogeneous Boltzmann 
evolution. Such models have been first proposed by Kac [12] and later thor- 
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oughly studied by Sznitman Il20ll . Spohn and others. There are iden- 
tical particles indexed by 0, . . . , — 1, each particle i being characterized by 
its velocity Vi G R"*. One imposes random collision times, so that the micro- 
scopic evolution is a Markov process. The way two particles with respective 
velocities v and w hit each other is described by some positive measure 7„ „, on 
N~^&^y^yj{v' , w') being the collision rate from state (f , w) to state [v' , w'). 
In other words, the generator C of the Markov process is 

Cf{vo, . . .,vn-i) = 

^ E / {-f{Vo,...,VN-l)+f{...,v'„...,v'^,...))&r,^,,^{v',,v'^). (1.1) 

We may add to this model some extra physical conditions. First, we will 
always suppose that the momentum and energy are conserved by collisions, 
and that the model is invariant by velocity translation or rotation, i.e. for all 
v,w e K'^, for any (positive) isometry J of R'^; 

7t,,i„-a.e. v' + w' = V + w, (1.2) 
7t,^„,-a.e. \w' — v'\ = \w — v\, (1.3) 

Jjv,Jw = {J, J)*1v,w (1-4) 

When conditions ( I1.2D to ( I1.4D are satisfied, the model is completely described 
by the family of measures (7n)ug(o oo) where d7„(6') is the proportion, 

by unit of time, of particles with relative speed u which undergo a collision 
making them deviate by an angle 9 in the collision referential. 

Moreover, it is often assumed that the 7„ have a scale invariance property, 
in the sense that there exists a real parameter g such that for any A G (0, +oo), 

Ixu = A^7.- (1.5) 

The hard sphere model corresponds to the value g = 1. Another very in- 
teresting particular case is when g = — then one says that the model is 
Maxwellian. In this article the concrete results obtained actually will concern 
Maxwellian models. 

Before turning to the macroscopic model, let us make some remarks on 
our microscopic model: 

1.1 Remark. 1. The factor in Equation dl.ll) is essential to get the 
mean field limit: it morally says that the global collision rate of one 
particle is independent of the total number of particles. 

2. Strictly speaking, generator dl.lD allows a particle to collide with itself, 
which is physically absurd. Yet because of the conservation law (II. 3D . 
the auto-collision term is actually zero, so there is no problem. 
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3. The 7t, „, have to satisfy some integrabihty conditions for the Markov 
process to be well-defined. For instance, if conditions ( I1.2D to dl.SD are 
statisfied, then it suffices that for an arbitrarily chosen u G (0, oo), 
/o"^^-id7„(0) is finite El. 



1.2 The macroscopic model 



The macroscopic space-homogeneous Boltzmann equation IH is obtained in- 
formally by letting N tend to infinity in the microscopic evolution. Then the 
particles' velocities are described by they empirical measure, which is a (pos- 
sibly non- atomic) probability measure iit on R'^. The evolution of that measure 
is deterministic and is governed by the equation: 



Dtfi = Q{fit,fJ't), (1.6) 
where Q is the Boltzmann collision kernel of the system, formally defined by: 

Q{n, ^) = ^ jiyj {- ^v- 5w + 5^' + 5u,')d-i^^ui{v' , w')^ (]^{v)diy{w). (1.7) 



Equation ( I1.6D is an ordinary differential equation in an infinite-dimensio- 
nal space; that equation is non-linear because of the quadratic term Q{fi,fi). 
Unique existence of a solution to it has been thoroughly studied over the last 
decades BH [Mil. For our theory to work, we will need to work in a setting 
where that unique existence is achieved in some convenient space — which 
is quite logical altogether. Later we will see concrete examples where ( I1.6D 
behaves well for our purpose. 



1.3 Conservation laws, convergence to equilibrium 

Because of the conservation laws ( I1.2D and dl.SD . we get d + 1 invariant func- 
tions for the microscopic system: the first d are synthetised in the momentum 
P = ^2^=0^ the last one is the energy K = ^ J2iLo^ macro- 

scopic model, these invariants become p = J vd^{v) and k = \j \v\'^ dii{v) . 
Moreover the fact that the macroscopic model derives from the description 
of an evolution of particles implies two extra properties for it: first positivity 
of Equation ( I1.6D . which means that if /iq is a positive measure, then so are 
the for t positive; secondly conservation of mass which gives the {d + 2)-nd 
invariant m = J djj,{v) for the macroscopic equation. 

Concerning equilibrium, if we impose some minimal non-degeneracy con- 
dition (see [21]), then it is a well-known beautiful result due to Boltzmann [2]| 
that Equation dl.GD is dissipative for positive measures and then converges 
to an equilibrium measure /Xeq depending only on p, k and m: for m = 1 and 
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p = 0, it is 



d\v\'^ 



(1.8) 



and it has the invariance properties /ieq(p, k, 1) = rp#/ieq(0, k — 1), being 
the translation by vector p, and f^eqi^P, ^k, Am) = Xfieqip, k, m). More recently 
a beautiful quantitative version of that convergence result has been proved 
by Carlen, Gabetta and Toscani ||5]]. 

For the microscopic model, there is also a unique ergodic equilibrium mea- 
sure for each value of P and K {N being fixed), which is merely the uniform 
measure on the {dN — d— 1) -dimensional sphere^j of (R'^)^ made of A^-uples of 
vectors having these P and K. Note that if there are particles with momen- 
tum Np and energy Nk, the marginals of that measure tend to the continuous 
equilibrium measure fieq{p, k,l) when N — > oo. 

It is worth recalling that the microscopic process is reversible under its 
equilibrium measure, while on the contrary the macroscopic equation ( I1.6D 
exhibits a dissipative behaviour — a phenomenon which caused much trouble 
at Boltzmann's time, but has been well understood today. 



2 Homogeneous Sobolev spaces 
2.1 Why homogeneous Sobolev spaces? 

To be able to speak of quantitative convergence we will work in some Banach 
space. Which one will we take ? As we want to compare the empirical measure 
of our particle system to its limit evolution, a natural choice is to take some 
coupling distance between measures — say, the Wi Wasserstein distance Il22|. 
§ 7], defined for ^, v two positive measures with the same mass by: 



where "/ 1-Lip." means that the supremum is taken over all 1-Lipschitz func- 
tions on W^. However it turns out that it is hopeless to get an N~^l'^ rate 
of convergence in such a space, because testing v — against so much test 
functions makes the uniform central limit theory fail — see [8, § 6.4] for more 
details. 

Thus the idea is to test v — ^ against a smaller space made of more regular 
functions. Sobolev spaces 1^'^, s > 0, are such natural test spaces; then 
V — ^ will be seen as an element of the dual space For our theory 

we will have to work in a Hilbert space, so we will take p = 2 and work 

**' Possibly of radius 0. 



W\{\i^ z/) = sup 

/ l-Lip. 




(2.1) 
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in 1^"'^'^ = then we can take s fractional, and it will turn out to be 

useful indeed. Yet since defining a norm for spaces requires to choose 
some aribtrary length, which is physically annoying, we will rather consider 
homogeneous H^" spaces, which among other advantages do have a canonical 
norm. Let us define these spaces properly. 



2.2 Definition and useful properties 

2.1 Definition. Let seR, and for / e set 

\\f\\H-^^ = (^Jjm\'\^\-'''^y' ■ (2.2) 

Then those of the / G 5(R'^) for which < oo, equipped with the norm 

II ■ \\h-^, constitute a pre-Hilbert space with scalar product 

{f,9)H-s= [ mm\^\-''<^- (2.3) 

The Hilbert space obtained by completing it is denoted if ^ 

2.2 Remark. For a physicist, / : K'^ — > C has some homogeneity: say, the 
elements in IR°' are measured in x (generally x is a unit of length, say meters) 
and the elements in C are measured in y (which will often be a density, say 
kg . m^'^). Then ||/||if-s is measured y ■ x^'+'^Z^ (in our example, ||/||i/ = would be 
measured in kg . m^~'^/^). Equivalently, if ;U is a measure on H'^, the physical 
dimension of ||/i||if ^ is z x'* '^Z^, x being the physical dimension of the elements 
of K'^ and z the physical dimension of jj, (which in our example would be kg). 

As we recalled in § 12. 1|, bounding a function or a measure in H^^ means 
bounding uniformly its integral against some class of regular functions: 

2.3 Proposition. Define in the same way as H^^, then, for any f for which 
it makes sense: 



Wh-s = sup 



f{x)g{x)dx 



(2.4) 



2.4 Lemma. For s g]0, d[, let tps be the locally integrable function 

= \x\~^''-'\ (2.5) 

then one has for all f,g G S{'R'^): 

(/, g) = c(s, dY{f *^s,9* 0s>^2(i^d), (2.6) 

with 

T((d-s)/2) 

r(.) being Euler's Gamma function. 
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Proof. Use that the Fourier transform of |,^| * is (27r)'^/^c(s, d)(j)s{x), cf IfTTj ex- 
ercise V-10]. □ 

2.5 Claim. Let J\ be a similarity of R,"' with dilation factor X, then for any 

map f G H^'^, 

\\foJx\\H-^=y^''^'\\f\\H-s. (2.8) 
Equivalenty, for any measure /i G ^ 

\\J^*^^\\H-s = \'~'"^MH-s. (2.9) 



fl From now on, we will always implicitly write s = d/2 + r. 

2.6 Proposition. Suppose d ^ ^ and let fi be a compactly supported signed 
measure on H'^ with total mass 0, then for any r G (0, 1), /i can be seen as an 
element of H~^. 

Proof. Thanks to Lemma 12.41 we just need to prove that yu * 0^ is a square- 
integrable function. Suppose that yu is supported by the ball B{R) of radius R 
centered at and splits into /i+ — yu_ with yu+ and /i positive measures each 
of total mass M. Then for p > 0, on B{p), * 0^ is equal to p * (1b(r+p)0s), so 
the norm of lB(p)(/i * (ps) is bounded above by 2M • \1b{r+p)<Ps\l'^ < oo. Thus 
/i * 0s is locally L^. On the other hand, for |x| = p > R, 

SO jj* (j)s is at infinity, which finishes the proof □ 

2.7 Corollary. Still suppose d ^ 2, then for r G (0, 1), any signed measure 
with zero total mass, if it has an r-th polynomial momentum, can be seen as 
en element of H~^. 



Proof. Let p = /i+ — p be such a measure with its Hahn decomposition, p+ 
and each having total mass M. Then the integral Minkowski inequality 
gives 



M ./(]Rd)2 

C r 



M 



X — y\^dp+{x)dp_{y) < oo, (2.11) 



Cr being the H norm of any 6x — Sy for \x — y\ = 1, which is finite by the 
previous proposition. □ 



The proposition remains valid with d — I, except that it must be demanded that r < 1/2. 
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2.8 Remark. Note that the if * norm allows us to measure the distance be- 
tween two (sufficiently integrable) probability measures, however speaking 
of the ii"* norm of a single probability measure would be nonsense! Note 
also that, by Sobolev imbedding, one can bound above \\u — yul^f s, for any two 
probability measures fi and u, by (up to some explicit multiplicative constant) 

W^i,,(/x,z/)=sup{| [fdfi- [ fdi^\-Wx,y\f{x)-f{y)\^\y-x\'}. (2.12) 



3 Dynamic control 

3.1 Abstract setting 

Now let us study the evolution of our particle system along time. For the sake 
of elegance we are going to state our results in an abstract setting first. 

Let H he a Hilbert space (to be thought of as '*(R,'^)), and let {Xt)t^o 
be some jump Markov process on that Hilbert space {Xt has to be thought 
of as Ji^, the empirical measure of process dl.lD ). with generator C. Stricto 
sensu C acts on some space of real functions on H, say the space of continuous 
bounded functions Cb{H, R), but its definition can be straightforward genere- 
alized to the space Cb{H,E) for any Banach space E, defining the operator 
: CbiH, E) — y Cb{H, E) through: 

yfeCbiH,E) (0,/:(^)/) = /:((0,/)). (3.1) 

By abuse of notation we will still denote C for C^^\ thus giving a meaning to 
expressions like CI, I being the identity on With that notation, define 
{Xt)t^o as the deterministic process on H following the differential equation: 

DtX = {Cl)iXt). (3.2) 

Equation (lO) has to be thought of as (fLGl) . 

Our goal is to control the distance between Xt and Xt. Here what is 
important for us is to have a good control of large deviations for that dis- 
tance. As Cramer's method cannot be applied directly because of the infinite- 
dimensional setting, we introduce an exponential utility function U : H — > R 
defined as 

= e»"» +e-»"'. (3.3) 
The following claim gathers the properties of U we will use in our work: 

3.1 Claim. 1. For all x e H, U{x) ^ ei^'; 

2. W(0) = 2; 
In that case E = H. 
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3. For all x,heH, U{x + h)^ e^''\\U{x); 

4. U is of classed 

5. For all x E H, VU{x) is positively colinear to x; 

6. For all x e H, \\\V^U\l ^l{{x). 

Then one can state the theorem which will be our central tool. We first 
need some notation to alleviate our formulas: 

3.2 Definition. We denote ei(t) = (e* — l)/t, extended by ei(0) = 1, resp. 

e2{t) = (e* — 1 — extended by 62(0) = 1/2. We also denote /t_ the negative 
part of K, i.e. = max{— k, 0}. 

3.3 Theorem. Suppose that Equation ( f3.2l > has a n-contracting semigroup for 
some K eli, in the sense that for all x,h e H: 

{D^{CI)-h,h) ^-n\\hf. (3.4) 

Suppose moreover that the Markov process — which we recall to be a jump 
process — has the amplitude of all its jumps bounded above by some L < 00, 
and satisfies: 

^xeH C{\\--xf){x) (3.5) 

for some V < 00. 

Then, denoting Xq the (random) initial value of the Markov process and Xq 
the (deterministic) initial value of the differential equation i3.2i . one has for 
any T ^ 0, for any X > 0: 

\nE[U{X{XT - Xt))] 

^ lnE[W(Ae-"^(Xo - Xq))] + X^e2{Xe^^-^ L)ei{-2KT)VT. (3.6) 

Proof. The principle of the proof is merely to show that some time-depending 
functional 

F{Xt) = e^^''^u(Xe^^'-^\Xt - X^)), (3.7) 

for a well-chosen function h, is a supermartingale. 

fl To make our computations completely rigorous, throughout the proof we 
will assume that the expected number of collisions per unit of time is uniformly 
bounded, that is, that there is some M < 00 such that \{C1a}{x)\ ^ M for all 
Borel subset A c H and all x e H. Then the general result can be recovered by 
a standard truncation argument. 

Let us fix some t e [0, T] and suppose {Xt')t'(^[o,t] is known. Let & be a small 
amount of time devised to tend to 0; 0(&") will denote any quantity bounded 

To see it, note that U{x) = /(||a;p) where / = 2cosh(y^) is C°° on the closed interval 
[0, +00 [ — actually / can be extended analytically to the whole of R. 
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by some C&" when 6t tends to 0, where C depends only on k, V, M, X, T, t and 

ll^tll- 

With this notation, the law of Xt+st depends on {Xt')t'e[o,t] only through Xt, 
and our goal is to show that E[F(Xt+a)]— which is 0{&), is nonpositive — 
more precisely, we only need to prove that E[F(Xt+s)] — F{Xt) ^ 0(&^j*l3. 

Set Y^= X-X. Denote 5X = Xt+^-Xt, resp. 5X = Xt+^-Xt, <# = Yt+st-Yt, 
5F = F{Xt+st) — F{Xt). The fundamental observation is that 

E[(5X] = {Cl){Xt)8t + 0{8t^). (3.8) 

Now, admitting temporarily that h will be of class C^, we write: 

5F = h'{t)F{t)8t (3.9) 
+ e^WAe''(*'^)VW(Ae"(*-^)ft) • {CI{Xt) - CI{Xt) + KYt)5t (3.10) 



+ e 



hit) 



(3.11) 



+ 0(«2). 



In that sum we first see that the term ( I3.10D is nonpositive: ( 13. 4D indeed 
implies, for all x,y e H, 

{{CI){x + y)- {CI){x) + Ky,y)^0, (3.12) 

which we apply here with x = Xt and y = Yt, using that VU{Xe'^^^~'^^Yt) is 
positively colinear to Yt (Claim [3^11 15]). 

Now let us look at term ( 13. IIP . Because of (13. 8D . the expectation of the 
random variable 

Ae'^^*-^) - {Yt + [CI{Xt) - CI{Xt)]d)) (3.13) 
is 0((S:^). We will use it thanks to the following 

3.4 Lemma. Let X e H; let y he a random variable with zero mean. Then one 
has: 

E[W(X + y)] ^ U{X) (1 + E[e2(||y||) \\yf]). (3.14) 
Proof of the lemma. Taylor's formula yields 

U{X + y)= U{X) + VU{X) -y+i^j^ (1 - 0)V^U{X + ey)dB^ ■ {y ® y). (3.15) 
In that sum the third term is bounded above by 

\\yf [ {i-e)u{x + ey)de (3.16) 

Jo 



Beware that "expr. < 0((i")" does not mean "expr. ~ 0(A")" but actually "(expr.)^ 
0((»")". 
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by Claim I3.1l l6t which in turn is bounded by 

\\yfU{X) f\l ~ e)e'\\y\\&) = e2{\y\)\\yf. (3.17) 

by Claim 13.11 (31 Taking expectation gives the result since the second term in 
sum ( I3.15D has zero mean by assumption. □ 

What does it give for us? Let A be the event "some collision occurs between 
t and t + (*". A is an event of probability 0((*), on A, (I3.13D is 0(1), and 
on A it is 0{5t). Hence, denoting temporarily * for that random variable, 

E[|| * Ipeadi * ID], up to some 0(«2), is merely \^e^<'-^^¥.[\Sffe2{\\e<'-^^^\)\ , 
which is bounded above by A^e^'*'^*~^)e2(Ae^''-^L)y uniformly in t. 

Putting all things together, we get 

E[5F] ^ (h\t) + A2e2"(*-^)e2(Ae2'^-^L)r)F(t)« + 0(«2), (3.18) 

which will be ^ 0{8t^) provided 

h'it) ^ -W^<'-^^e2{Xe^''-^L)V. (3.19) 
To achieve that optimally with h{T) = 0, we choose 

hit) = A2e2(Ae2«-^L)ei(2/t(t - T))V{T - t), (3.20) 

which is of class indeed. Formula ( 13. 6D then follows by the supermartingale 
property. □ 

3.5 Remark. Strictly speaking our proof only shows that F{Xt) is a local su- 
permartingale. But this local supermartingale is nonnegative, so it is actually 
a global supermartingale (see 1.16, § IV-1.5]). 



3.2 Application to Boltzmann's model 

To apply Theorem [3^3] to our Boltzmann model, we have to compute the values 
of L, V, K and E[W(Ae~''^(Xo — -'^o))] • In this subsection let us just look at the 
first two quantities — the last two ones will be the objects of separate sections. 

fl From now on, when dealing with Boltzmann models we are working in 
the space H^^iMf^) for some r G (0, 1). We denote by Cr the H^^ norm of any 
— for \x — y\ = 1, which is some finite explicit function of d and r. 

Recall that K denotes the energy of the A^-particle system, which is con- 
served along the stochastic evolution — note by the way that up to translating 
the origin of R"', we can replace K by the internal energy 

K = K - ^. (3.21) 
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Then at any time no particle has speed greater than \JlK, so the effet of a 
coUision between two particles on the empirical measure can be no more than 
2 • (8-ft')^^^Cr.A^~^, which gives an admissible value for L. 

3.6 Remark. To get the bound L ^ 2 ■ {SKy/'^CrN-^ we have used that the 
relative speed between two particles is at most 2y/2K, and that the effect of a 
collision with relative speed u is at most 2u^N~^. Actually one can do better: 
the relative speed between two particle is at most 2v^ and the effect of a 
collision with relative speed u is at most 2^2^"'' — 1 CrU" /N, so we could have 
taken 

L = 2i+V2i-^' -lCrK''/^N-\ (3.22) 

It is that bound that we will use in the sequel. As it does not change qualita- 
tively the results compared to the rough value L = 2 ■ {SKy/'^CrN^^ , I let the 
proof of ( I3.22D as an exercise for the reader. 

Anyway remember that, since K is going to be of order of magnitude 0{N), 
one has L = O^N''/"^-^) when N — > oo. 

Now let us compute V, which has been defined by ( 13. 5D . V must be a 
bound for the expectation by unit of time of the square of the jumps done by 
the process X, which, denoting X = jj, for the Boltzmann model, is bounded 
above by 

2Cr'^N^^ I Iw - t;|2"d^(t;)d/i(w) 

i(Rd)2 

Jensen „ , / /" „ \ 

^ 2Cr^N-^ / \w- v\^d^x{v)d^i{w) 

= 2i+2-c,2iV-i j ^2i+2^a'i^"iV-(^+'-). (3.23) 
Taking into account Remark l3.6|. one can even take 

V = (2^^" - l)2l+2r^^2^rjy-(l+r)_ (3^24) 

AnjTway remember that V = 0{N^^) when N — > oo. 



3.3 Comments on the results 

fl All the computations in this subsection are purely heuristic, so we will 
drop lower order terms without wondering when we can do so. Ci, C2, . . . will 
denote constants depending only on k, V, L and T, whose exact expression does 
not interest us. 

In the right-hand side of Formula ( 13. 6D there are two terms: the first one, 
lnE[W(Ae~''^(Xo — Xq))], merely expresses the difference between the experi- 
mental initial condition and its continuous limit. There is obviously no sur- 
prise in getting such a term, whose study is deferred to §[5l For the time being 
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just notice the presence of the factor e in front of Xq — Xq, which means 
that the effect of initial fluctuations will be quite large if /t < 0, and conversely 
quite small if k > 0. 

The actual dynamic effect in (I3.6D lies in the term A^e2(Ae^''-^L)ei(— 2/tr) 
VT. Let us study it in the case of our Boltzmann model, according to § I3.2[ 
We have noticed that, when becomes large, one has L = 0{N''/'^^^), resp. 
V = 0(iV 1). So let us write L ~ m^/'^-\ resp. V ~ ujN-\ Then the dynamic 
term of ( 13. 6D becomes: 

X^e2{\e^''-^L)ei{-2KT)VT ~ \^N-^e2i\e^"-^iN'-/^-')ei{-2KT)ujT. (3.25) 

The A^A^^ factor hints that the good order of magnitude for A will be A = 
0{N^/^). So write A = yN^/^; then (13:251) becomes 

A2e2(Ae'"-^L)ei(-2/€T)yT ^ e2{ye^''-^iN^'~'^/^)ei{-2KT)ujy^T. (3.26) 

In our case (r — l)/2 < so, if is sufficiently large, ye'^'^-'^iN^^^^^^'^ is very 
close to zero and the e2(*) term is very close to 62(0) = 1/2, finally giving 

X^e2{Xe^^-^ L)ei{-2KT)VT ~ ^ei{-2 KT)uy^T. (3.27) 

For a fixed T, ( I3.27D shows that the dynamic term in Formula ( 13. 6D 
is approximately Ciy"^. Moreover, as we will see in §[5l the static term 
In E[U{Xe~'^'^{Xo — Xq))] is approximately C2?/^ + C3. In the end, one gets 

\nE[U{yN'/\Xt - Xt))] < C,y^ + C3, (3.28) 

hence by Markov's inequality and Claim [STTI fTI. for all x > 0, 

p(^yN^/^\\XT - Xt\\ ^ x) < e^^y'+^'-''. (3.29) 

Optimizing Formula ( I3.29D for fixed x/y ratio, one finally finds: 

Ve^O P{\\XT-XT\\^e) <exp{C3-C5Ne^). (3.30) 

So Theorem 13.31 applied to the Boltzmann model gives a Gaussian control 
for the fluctuations between Xt and Xt for any fixed value of T — provided 
the existence of some contractivity constant k, which will be proved for the 
Maxwellian case in §[4l Moreover the order of magnitude of the fluctuations 
we get is A^^/^, the typical deviation size in central limit theorems. So we 
may say that the bounds we have got are a kind of explicit dynamic central 
limit bound for the Boltzmann model. 

3.7 Remark. Actually the approximations we did to get ( I3.28D are sensible 
only if y is not too large, otherwise Ae^''-^£A^''/^^ > 1 and then the e2(*) term 
in ( I3.25D cannot be considered as close to 1/2. It follows that our computa- 
tions are valid only for A < A^^ '/^/£, i.e. for y < N^'^^'''^^^ /i. Tracking that 
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constraint throughout our reasoning, it finally turns out that 03. SOD is only 
valid for e < TujN~'^/^/i. So our Gaussian control does not hold up to large de- 
viations but only to intermediate deviations. Fortunately (I3.30D tells us that 
the probability of such intermediate deviations is bounded above by some- 
thing like e"*^*^^ , which goes very fast to anyway. Moreover, even for 
£ > iuTN-^/'^/i one can still use (l3^ with y = N^^-'-y^/i and k ~ yN^/^e, 
which gives an exponential control of the tail of the law of \\Xt — Xt\\ applica- 
ble to large deviations. 

The behaviour of Formula ( I3.27D as T becomes large depends on the value 
of k: 

• If K < (the worst case), then the ei(— 2fi:T) factor becomes exponen- 
tially large as soon as T > Thus the dynamic control given by 
Theorem 13.31 is relevant only for moderate values of T corresponding 
to durations for which each particle makes only a couple of collisions. 
Moreover, as we noticed in the beginning of that subsection, in that case 
the term due to the control of initial fluctuations will also become huge 
as T increases. Note however that qualitatively we get a Gaussian con- 
trol for any fixed T, only the constants in that control becoming bad. 

• If K = the dynamic term of ( 13. 6D increases proportionally to T, so our 
bound remains good even for moderately large values of T, but ulti- 
mately gets uninteresting. 

• If K > (the best case), then Tei{-2KT) ~ 1/2k when T — > oo so 
the right-hand side of ( 13. 6D remains bounded uniformly in T, implying 
that the A^-particle model approximates well its continuous limit for any 
time0. Note that k > is tantamount to having an exponential con- 
vergence of ( I1.6D to equilibrium in H~^, so in that case our bound rather 
looks like a result of convergence to "equilibrium" for the empirical mea- 
sure Ji^. 

4 Contractivity of the collision kernel 
4.1 Limitations due to our settings 

In this section we are going to look for computing constant k in ( 13. 4D . Un- 
fortunately it turns out that, for the choices we have made, our results are 
unavoidably limited, as we quickly explain in this foreword. Let me stress 

Of course, it does not mean that one random particle system has large probability to stay 
always close to the continuous limit — which is trivially false by ergodicity — but actually that 
at any given time, most of the particle systems will be close to the limit. 
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however that all the issues encoutered may be solved by working in a trickier 
space than the mere if "* space. 

First K can only be negative, which is the worst case (see page fl4b . Why 
that? Well, if n were positive, as we said previously it would imply conver- 
gence of Equation dl.GD to a unique equilibrium for all probability measures. 
Yet there are several different equilibrium probability measures for the Boltz- 
mann evolution (see Formula dl.SD and below), whose differences lie in 
which is a contradiction. So k is nonnegative. Then we could prove, using 
that the model is nondegenerate, that k cannot be zero and thus is positive. 
To have a chance to get negative values of k, H^'' should be replaced by a Ba- 
nach space which only contains signed measures such that / //(cb), / xri{dx), 
J |a;p?7(dr) = — but which one? 

Secondly, the only chance for k to be finite is the case of Maxwellian 
models (remember definition below dl.SD ): as will be seen later, this is due 
to a bad scale invariance property for non-Maxwellian models. Though the 
Maxwellian case is often a useful first step for theoretists, the physical mod- 
els encountered in real life do not have any reason for being so! To have a 
chance to get results for non-Maxwellian models, H^^ should be replaced by 
some non-homogeneous space — but non-homogeneous spaces are often less 
tractable than homogeneous spaces and more difficult to interpret physically. 



4.2 Principle to the computation of k, 

Some readers may have jumped when reading hypothesis d3.4D . How can such 
a linear regularity hypothesis apply to the nonlinear collision kernel •)? 
The trick is that, because of positivity and conservation of mass (see § 11. 3D . 
one can consider the Markov process as restricted to the space of probability 
measures (with r-th polynomial moments), which is some closed subset of an 
affine i/ * space (recall that a probability measure itself is not an element of 

4.1 Lemma. If, for one arbitrary (then for all) v E H'^, the linear operator 
Q{5y, ■) : H^^ — > H^^ satisfies the "contractivity" property 

^feH~^ (Q(5„,/),/)^_|||/p, (4.1) 

then the restriction ofQ to the probability measures with r-th moment satisfies 
hypothesis 43. 

Proof Hypothesis d3.4D for the probability measures with r-th moment just 
means that for all probability measures fi, for all / G H~% 

(2Q(/i,/),/)^-«:||/f. (4.2) 
That follows directly from d4.1D by writing the integration formula jj = 
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4.2 Remark. It is not difficult to see that conversely the best k possible in ( 13. 4D 
is exactly the best k possible in 04. ID . We do not prove it as it is not essential, 
but it will be implicitly used in Remark [4341 

4.3 Lemma. Recall definition ( [2.5P of(j)s. Note {*(j)s) the convolution operator 

^"^'-f,. (4.3) 

Then Q{5y,-) : if ^ — > H^^ satisfies property (EZP if and only if 

(*0s) o Q{6,, ■) o ■.L^—.L^ (4.4) 

satisfies the same property in the space L'^(R'^). 



Proof. It follows directly from the isomorphism formula ( 12. 6D . □ 

4.4 Remark. Now we can understand why k cannot be finite for a non- 
Maxwellian model: suppose the model satisfies dl.SD with g ^ 0. For A G 
(0, +oo) denote by Ix the homothety transforming v into Xv. Then you get 

Q (5o, /A#yu) = X'h#Q (5o, /i) , (4.5) 

so if Q{Sq, ■) were ^-contracting for a /t < it would also be A^K-contracting for 
all A, thus -contracting, which is impossible. 



4.3 Effective computation 

4.5 Lemma. Let 6 e [0, vr]; define a linear operator Qq on the set of measures 
on H^, such that Qjidv) is the uniform probability measure on the {d — 2)- 
dimensional spherS*^ of velocities v' such that \v' — v/2\ = \v\/2 and v^v' = 6. 
Then 

(*0.) o Qe = {cosie/2)YQg o (4.6) 

4.6 Remark. Qe{Sv) respresents the post-collisional distribution of velocity of 
a particle at initial velocity v which has collided with a particle at initial 
velocity 0, undergoing an angular deviation 6 in the collision referential, the 
precise direction of that deviation being random. 



Proof. First let us give a neat proof working when d is even. Call TZg the set of 
the rotations R of R*^ satisfying vO{Rv) = 6/2 for all v e R"*. If d is even, TZg is 
non-empty and has some canonical probability measure 7ig equipping it. Then 
we notice that 

Q,(/i)= / [cos(^^/2)i?]#/id7r,(i?). (4.7) 
That sphere degenerates into a point if6e {0, tt}. 
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Because of the rotational in variance of 0s, for any R e Re, 

{*(!),) oR* = R#o{*(j),). (4.8) 
Similarly, the scale invariance of 0s makes that for any A G (0, oo), 

(*0s)o/a# = A^-/a#o(*0s). (4.9) 

The result then follows by applying Formulae 04. 8D and (14. 9D to the inte- 
gral (l4?7l) . 

When d is odd unfortunately I have nothing better than a calculation — 
which also works for even d. Choose an arbitrary w > 0, we will prove that 
{Qe^v) * 4>s = Qei^v * 4>s), where v denotes the point {v,0, . . . ,0) e IR!^ whenever 
that makes sense. Since these two functions are invariant by any rotation 
around v, we will locate a point in K'^ merely by its first coordinate z and its 
distance p to the z axis; we will also denote Z = ^/z^~+~f^ its distance to 0. In 
the following calculations S denotes the unit sphere in R'' \ equipped with its 
Lebesgue probability measure a, and p denotes the point (p, 0, . . . , 0) G R'^^ 
whenever that makes sense; points of S are denoted y = {yo,yi) with y^ g R, 
yi G R''"^. Treating (QeSy) * 0^ as a function, we find: 

{{QeSv) *(ps){z,p) 

= / |(cos(^^/2)t;-z)'+ (sin^yo^/2-p)' + (sine)2|yi|V/4| Mvo.yi) 

= y |z2 + cos(^/2)V -2cos(^/2)2t;2-sin^wp?/o} ^'^ '^'"^ ^{y^^yx)- (4.10) 

For Qe[by*<^^ it is more complicated since that case needs computing a expres- 
sion of type Qej , f being a function. Usually that kind of computation raises 
no difficulty, but here the operator Qe has some singularity which makes it 
less tractable: in Qef, the "mass" (in the measure sense) received by the point 
(2, 0, ... , 0) comes only from a (d— 2) -dimensional sphere in R"' — more precisely 
the sphere of points {z, p), p & R'^ ^ with |p| = ta.n{9/2)z. That regularity prob- 
lem can be overcome by an approximation technique, yielding: 

(Qef) 0, . . . , 0) = ^^^^j^, [iM0l'^)Ay)My) (4.ii) 

— that formula also allowing to compute Qef at points not located on the z 
axis by rotational invariance. 

So 

(Qe(5.*0s))(^,p) = cos(0/2)-'^- 

/ {(z - tan(e/2)/>yo - vf + (/> + tan(0/2)zyo)' + t^O 1'^? Z^bi?] '^'^ '^'^ <^(T{yo,yi) 

= cos(0/2)-'^ / {(l + tan(0/2)2)z2 -2w(z-tan(0/2)/9yo) +^^^}~^'^~'^^^dcj(yo,yi) 

= cos(0/2)-^((Q05,) * 0s) P). (4.12) 

□ 
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4.7 Corollary. Let Qe = Qe + Qn-e — Qo — Qn- Then, for any f e H \ 

{Qef J) ^ [{cos{e/2)Y +{s\n{e/2)Y -I] ■ \\ff. (4.13) 

Proof. Observe first that Qo is the identity and that Qtt = 0, so it suffices to 
prove that the operator norm of Qe in ij is bounded above by ( cos(6'/2))'^. 
Because of isomorphism formula ( 12. 6D . that is also the norm of o Qe ° 
{*4>s)~^ in L^, which is cos{6/2yQe by the Lemma |431 Thus we just have to 
bound the norm of Qe, regarded as an operator in L?', by cos(^^/2) '^Z^. Now we 
note that one can write _ 

Qef = Icos(e/2)#Qef, (4.14) 

where Qe is the kernel of the Markov chain on H'^ which sends x uniformly 
to the {d — 2) -dimensional sphere of points y such that \y\ = \x\ and xOy = 
6/2. But that Markov chain has the Lebesgue measure on K'^ as reversible 
equilibrium measure, thus IQelLa ^ 1, so that IQellLa ^ cos(^/2)"'^/^, quod 
erat demonstrandum. □ 

Now we are ready to state the main result of this section: 

4.8 Theorem. In a Maxwellian model, calling 7 the common value of all the 
measures 7^, the collision kernel Q, when restricted to the probability mea- 
sures, satisfies hypothesis ( f3.4l > with 

K= [1 -cos(e/2)'^ -sin(^/2)''] d7(^). (4.15) 
Jo 

Proof Note that 

Q{8o.-) = \ r Qe^{e) (4.16) 



2.0 

and apply all the previous work of this section (Lemmas 14. 1|, I4.3t 14. 51 and 1 4. 71) . 

□ 

4.9 Example. The "Kac" modefll is the case when the measure 7^ „, is always 
uniform on the sphere supporting it with total mass 1, i.e. it is a Maxwellian 
model with 

d7(e) = E(lzi) (sin ey-'^^B. (4.17) 

For that model one has — 00 < k < for any r G (0, 1). 

4.10 Example. The model of Maxwellian potential corresponds to particles 
having a repulsive force with a radially symmetric potential decreasing as 
p-{2d-2) ^j^g radius p between the two particles increases. For that model 
the measure 7 is not finite because then one has cfy ~ 9~^/'^<\9 when 9 — > 
for any d. Yet it remains possible to define both the A^-particle and the limit 
models, see Remark 11.11 (31 For that model, one also has —00 < k < for any 
r G (0,1). 



Actually this is not exactly the Kac model lfT2]| . but the spirit is the same. 
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5 Initial value 



In Formula (13. 6D given by Theorem I3.3|, as we saw, besides the dynamic term 
there is a term due to the fluctuations of the initial empirical measure. In 
this section we control these fluctuations in the case of i.i.d. initial particles. 

Let yu be a probability measure on R*^ and let r G (0, 1). We assume that n 
has an r-th exponential moment, i.e. that there exists some a > such that 

/ e'^l^'I'dt; < oo. (5.1) 

In the sequel we assume a to be fixed. 

If f is a random variable of R'^ with law fi, then 5„ — yu is a random variable 
in H~% whose law will be denoted by D^: is a centered probability measure 
on i?""*. I claim that has an exponential moment with parameter a, i.e. 

/ e"'''' d2)^(z/) < oo : (5.2) 

To prove it it suffices to note that 

\\5,, - ^ \\5y - 5o\\ + - = Cr\v - Vol'' + \\K - /^ll) (5-3) 
whose a-parameter exponential is integrable because of ( 15. ID . 

So the law has a finite exponential moment, a fortiori a finite variance. 
Let us denote it by a ^: 

a^= [ IWfdD.iu). (5.4) 

Now we have all the definitions at hand to state the main result of this 
section: 

5.1 Theorem. Let vq, . . . , wat-i be N i.i.d. random variables on K'^ with law ^, 
and denote fl^ = N^^ J2iLo^ their empirical measure. Then there exists an 
explicit constant A{fi), which is easy to bound, such that for all X ^ aN: 

E[U(X[r - nm < 2exp + ^) . (5.5) 



Before proving Theorem 15. Ij let us further examine Formula ( 15. 5D : the 
term in the exponential remains bounded when — > oo if A increases as 
N^/^, like in (13;^ . Thus, writing A = yN^/^ like in (13:281) : 

E[U{yN'/'ir-f^))]^2exp(^^ + ^^N~'/'y (5.6) 

Though we will not use it in the sequel, note the following 



19 



5.2 Corollary. For S ^ for all x ^ 0, for all N ^ No--= x^a^S^: 

P (i/^"^ ^ exp - 1^ + In 2 + A(/.)iVo^/2^-i/2^ . (5.7) 

5.3 Remark. ( 15. 7D works as soon as ^ x"^ /a?S'^, i.e. as soon as x ^ aSN^^"^, so 
that estimate is valid up to the large deviations setting. 

Proof of Theorem 15. 1\ The principle of the proof is exactly the same as for 
Theorem 13. 3|, except that here time will be discrete. 

Let vq, . . . ,vn-i be N i.i.d. random variables with law ^ and set Mj = 
^*~qA^~^(5^- — /i), then (Mj)j is a Markov chain and a martingale, and Mm 
has the same law as Jl^. We are going to prove that 

E[U{Xm+,)\m] ^ exp + ^^^^ ■ W(AM,), (5.8) 

whence the result. 

To get ( 15. 8D . thanks to Lemma [3^ it suffices to prove that 

I (..A-IH _ 7V-'A7V-'|M) dl>,(.) « exp + (5.9) 

We set 

= j (e^l'^l' - a||z/|| - l)dD^(z/). (5.10) 
The function 62 (t) = (e* — 1 — is convex on R+, so 

Vt^O V^G[0,1] e^*-^t-l ^ ^(l-^)^V + ^3(e*-t-l). (5.11) 
Consequently 

J (e-^^«^'-Ar-U||H|-l)dI)>)^ (l-AaiV-)|^ + ^, (5.12) 

whence dCT . □ 



6 Discussion 



6.1 Examples of synthetic results 

Until now in our article we have just given separate results, mainly Theo- 
rem [Ml Formulae (13.221) and (I3.24D . and Theorems |J8] and [O It is obvious 
that all these results are to be put together to get synthetic results on the 
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convergence of A^-particle dynamic models to their mean field limit; yet we 
have not done it in the previous sections. 

There are several reasons why I have postponed the presentation of such 
synthetic results to the last section. The most obvious one is that these global 
results would have been quite unreadable if put in the beginning of the article. 
More important, the different "bricks" of results given within the core of the 
paper are open to improvements different for each, some of which may work 
for some cases but not for others, so that there may be no optimal general 
result. 

Let us however give some examples of formulae got by piling our theorems 
together — proofs will not be given since they really consist in some plain glu- 
ing game: 

6.1 Theorem. Let d ^ 2, r e {0, 1). Let fio be a probability measure on with 
finite r-exponential moments for all r < 1. Up to translating the origin of R"' 
we can suppose that p-= Jj^^ fd/io(f ) = 0; then let k = \ j |f pd/io(f). Choose 
some ki > k and define 

K = 1 T{d-1) — [\in(e/2Ysm(eY-^deM, (6.1) 

2^-^T{{d-l)/2y Jo ^ ' ' ^ ^ 

i = 2i+V2i-'^ - lCrk[^\ (6.2) 
LJ = {2^'' -l)2^+^'Cr^k[, (6.3) 

(T^ = / \\6^ - fio\\ji-sdfio{v). (6.4) 



Let N ^ 2; let vq, . . . ^v^-i be N i.i.d. random variables with law /iq and 
let JIq be their empirical measure; denote = | Xlilo^ l^iP- V'f ^he 
empirical measure at time t of the Markov process with generator f fi.il > for 
the "Kac" model (i4.17i and initial condition (fo, . . . ,fAr_i). Let {fJ,t)t^o be the 
deterministic evolution f fi.gl ) for the same model with initial value fiQ. 

Then for any a > 0, there is a (easily bounded) constant A{a, fi) such that, 

for any T > 0, as soon as \ ^ ae~^'^^'^N: 



6.2 Corollary. For the same model, for any y ^ 0." 

7V"-I:;"oo L-A-^iVfti--V-- vr-r r^^./j ^ 



lim lnE[l^.^^,W(yiVV2(^f_^,))] ^ln2+e2NT^+ei(2|«:|T)^^. (6.6) 



* ■ ' Beware that n < 0. 
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6.3 Corollary. Still for the same model, for any x ^ 0; 

6.4 Remark. As (16. 7D is true for any value of ki, we can make ki approach k in 
it, which allows to replace lj by luq ■■= {2^-'' - l)2^+^'Cr'^k'' . 



6.2 Optimality 

Theorem 16. II essentially gives us a convergence to the continuous limit at rate 
AT 1/2 with gaussian control. Qualitatively it is the best result one can hope 
for, because it is the same way of convergence as for central limit theorems. 
Quantitatively however, is the parameter in the Gaussian bound optimal? 

Here we will look at what happens for Theorem 15.11 (Theorem 13.31 exhibits 
the same behaviour, but it is harder to see). Through Corollary 15. 2^ Theo- 
rem [53] gives some Gaussian bound in an infinite-dimensional frame. Yet its 
proof, whose main ingredient is the use of the utility function U, would work 
as well in a finite-dimensional setting. So let us imagine that we replace ij 
by and by the centered normal law with variance 1^, denoted by A/"; then 
(7^ becomes J^^ \x\'^(W{x) = d. In that case Jl^ turns into a random variable 
on H'^ which is centered normal with variance N^-^Id, and we get: 

P(X^ ^ a;Ar-i/2) ^ + 2 + ANo^^^N'^/^^ (6.8) 

for some A and Nq not depending on N, so making — > 00: 

^^i\X\ ^x)^ 2e-"'/2'i^ (6.9) 

whereas the exact result is 

m\X\ >x) = —— / y'-\-y'/^<ly ^ e--''\ (6.10) 
r(d/2) J,^ x^oo 

where "^^i" means "having equivalent logarithms". 

So for d > I the parameter in the gaussian bound is underestimeted by 
a factor d. Why that? Well, the proof of Theorem 15.11 uses the bound on the 
curvature of the utility function U given by Claim I3.1l l6l But as soon as x 
is reasonably large, the Hessian of W at x is much more curved in one direc- 
tion than in all the other ones, so that Formula (I3.14D in Lemma [3.41 becomes 
strongly suboptimal since the factor ||yp in it should morally be replaced by 
the sole component of the variance of y along the direction along which V^W(x) 
is most curved. 

So the techniques involving U are poor as soon as the dimension in which 
the random phenomena occur becomes large. For our particle models we work 
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in if"'', whose dimension is. . . infinite! Does that mean that our results are 
"infinitely bad"? Actually not, because each increment of the martingale M 
(see the proof of Theorem 15. ID is determined by the value of one Vi, so the 
law of this increment may be seen as a probability on BJ^. More precisely, the 
support of is isometric to R*^ equipped with the distance | ■ — ■ T , whose 
Hausdorff dimension is d/r, so that the "effective" dimension of ii in our 
theorem is d/r. 

As a consequence we had better not choose r too close to 0. On the other 
hand, the bigger s is, the more regular the test functions in the definition of 
" (see Proposition 12. 3D will be, so the less small-scale details || ■ \\h-o will 



catcho. So it should be advised to take medium values of r, e.g. r = 1/2. 



6.3 A numerical computation 

One important side of our work is that it gives non- asymptotic results. The 
idea behind it is that, to understand Boltzmann's evolution, we will not actu- 
ally look at N — > oo, but rather take some fixed large N and say that the be- 
haviour of the A^-particle system for that N is very close to the limit evolution 
with very large probability. In particular, think about the case of numerical 
simulation: we cannot afford dealing with 10^"^ particles on our computers! 

Here I will compute numerical values for the following case: the collision 
kernel is the one of "Kac" model for = 3 and we take fiQ = i + Physi- 
cally speaking, it means that we crash together two same-sized sets of frozen 
particles with relative speed 2. Then the collisions between particles of dif- 
ferents sets will tend to scatter the distribution of velocities of the particles, 
which will morally converge to the law dl.SD with k = 1/2 in a few units of 
time — this is the behaviour of Boltzmann's equation dl.GD indeed. The ques- 
tion is, which will we choose to be almost certain that the evolution of the 
particle system will be fairly close to ( I1.6D ? 

Say we take r = 1/2 and we want to have yUT||ij-» greater than e = 10~^ 
with probability less than q = 10^ for T = 3. As in our case ^ Nk almost 
surely, we take ki = k = 1/2. Then one computes the following numerical 
values, which are all rounded above: 

~ 0.600; (6.11) 
i ~ 0.432; (6.12) 
CO ~ 0.0933; (6.13) 

~ 0.0398. (6.14) 

We choose arbitrarily a = 1, then dS.lOD gives A{a,fi) ^ 0.0213. We have to 
take A > | \nq\/e, so let us put A = 500. For = 8 ■ 10^ we find by 

lnE[W(A(/2f -/it))] < 2.692, (6.15) 
Remember however that homogeneous Sobolev spaces have no inclusion relations. 
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thus 

P[\\l2^ -l^t\\>10-']<10-' (6.16) 

by Markoff 's inequality. 

So with a discrete system of 8 • 10^ particles one will much probably find 
a quite good approximation of Boltzmann mean field limit by running the 
particle system 3 units of time. Now simulating 8 ■ 10^ particles is easy for 
today's computers, which shows that our bounds can actually be useful in 
practice. However there is little doubt that the true speed of the mean field 
convergence is much faster than what our computations suggest. 

6.5 Remark. Here we have bypassed the problem of the 1 KN<Nki factor in ( 16. 5D 
by a specific argument. How can we do for it in the general case? Well, 
merely note that, as soon as one wants to have a result in terms of probability, 
they will just have to add P{K^ > Nki) to the probability they get forgetting 
the indicator. But the event {K^ > Nki} is a large deviations event, so as 
soon as fio has some square-exponential moment its probability will decrease 
exponentially with and thus cause no problem actually. 

6.4 Comparison to older results 

The "usual" method to tackle mean field limit problems relies on the concept 
of propagation of chaos, devised by Kac [12]: briefly speaking, one says that 
there is some "chaos" in a large assembly of particles if there is no correlation 
between the states of the particles, i.e. if the states of the particles are approx- 
imately i.i.dEl]. Then, if one manages to prove some propagation of chaos, one 
can study only the probability distribution of one particle to understand the 
behaviour of the whole assembly. That idea is linked with the concept of non- 
linear particle, due to Sznitman [20], which is a non-homogeneous Markov 
process describing the asymptotic behaviour of one particle when the number 
of particles becomes large, leading to a trajectorial version of propagation of 
chaos. 

Thanks to these tools Sznitman [flQfl proved propagation of chaos for spa- 
tially homogeneous Boltzmann models, thus giving a qualitative, asymptotic 
version of our Theorem 16. 1[ More recently Graham and Meleard [10] proved 
propagation of chaos for the more general Povzner equation, which is a kind 
of mollified spatially heterogeneous Boltzmann equation where each particle 
can collide with any particle in a small spatial neighbourhood around it. Yet 
these essentially qualitative methods hardly give quantitative results. 

My paper was motivated by the reading of [[H, in which BoUey et al. tackle 
some mean field limit problems in a quantitative way by working with Wi 
Wasserstein distances for the empirical measures. They get an explicit control 

Propagation of chaos may be seen as an attempt to make Boltzmann's Stojizahlansatz 
(cf. ISl) rigorous. 



24 



on the large deviations of the difference between the empirical measure of the 
A^-particle system and its theoretical limit for positive times. Yet there are 
two annoying shortcomings in their work: 

• First, it seems to be limited to McKean-Vlasov models, that is, systems 
where the interactions between particles are due to forces rather than 
collisions. In fact the proof fundamentally relies on a coupling technique 
(popularized by Sznitman ffT9l . Dobrushin [7] and others), in which one 
defines a coupling between the real assembly of particles and a virtual 
assembly of independent nonlinear particles. Such a technique has 
little relevance when one deals with collisions, because these events im- 
ply two particles at the same moment each time they occur, so there is 
no natural way of coupling. 

• Secondly, the results of fll are good for large deviations, but the control 
they give for medium deviations is far too poor to get, as we would wish, 
some N~^/'^ convergence rate. As we told in § 12. 1|, that is actually an 
intrinsic shortcoming of Wi distances. 

The idea of studying the empirical distribution of the particles in some 
Hilbert space to bypass the coupling problems is due to Fernandez and 
Meleard [9, JL4J, who wanted to study the problem of fluctuations of the par- 
ticle evolution, a problem linked to the uniform central limit theory (Si. Yet 
I did not know their work by the time I wrote this paper, and actually the 
framework here is quite different from that of [9]. To the best of my knowl- 
edge, my article is the first one to use Hilbert spaces in a collision model, and 
moreover in a non-asymptotic setting. 

To finish with this overview I must mention the current, independent 
work |fT5l| of Mischler, Mouhot and Wennberg, in which they devise a tech- 
nique studying directly the evolution of the empirical measure of the system 
in some Banach space. Their very general framework is liable to apply to a 
wide range of situations, though its quantitative version does not seem to give 
the optimal N^^/"^ rate of convergence. 

6.5 Uniform in time bounds 

The results we have given work for some fixed T, i.e. they control Wfi^ — htW- 
Yet it may be more natural to control sup^g^.T] ll/^f ~ fJ'th i-®- to say that the 
system is always close to the Boltzmann mean field limit between times and 
T, as [ 1 ] does for McKean-Vlasov models. We will not do it here, but note that, 
as we have used martingale techniques, getting results valid for all t e [0,T] 
can be easily achieved from the previous work by Doob's inequality IflGj § II- 
1.7]. Actually for k ^ it would turn out that uniform in time results are not 
much different from fixed time results, which is quite logical because then 
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the control on Wfi^ — is worst for t = T. For k > yet, when T is large the 
maximum of the difference between fl^ and jjt is much less well controlled 
than its terminal value, as we already noticed in Footnote ^\ at page [lH 



6.6 Going further 

I find my results rather disappointing in their present state, but I am still 
working on them, hoping to get substantial improvements, e.g. having cases 
for which k > or theorems applying to non-Maxwellian models. An impor- 
tant tool seems to be the choice of good functional spaces to work in rather 
than if % but it is still current work. 
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